Utility function security in artificially intelligent agents

نویسنده

Roman V. Yampolskiy

چکیده

The notion of ‘wireheading’, or direct reward centre stimulation of the brain, is a wellknown concept in neuroscience. In this paper, we examine the corresponding issue of reward (utility) function integrity in artificially intelligent machines. We survey the relevant literature and propose a number of potential solutions to ensure the integrity of our artificial assistants. Overall, we conclude that wireheading in rational selfimproving optimisers above a certain capacity remains an unsolved problem despite opinion of many that such machines will choose not to wirehead. A relevant issue of literalness in goal setting also remains largely unsolved and we suggest that the development of a non-ambiguous knowledge transfer language might be a step in the right direction.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Entropy-based Consensus for Distributed Data Clustering

The increasingly larger scale of available data and the more restrictive concerns on their privacy are some of the challenging aspects of data mining today. In this paper, Entropy-based Consensus on Cluster Centers (EC3) is introduced for clustering in distributed systems with a consideration for confidentiality of data; i.e. it is the negotiations among local cluster centers that are used in t...

متن کامل

Self-Modi cation and Mortality in Arti cial Agents

3 This paper considers the consequences of endowing an intelligent agent with the ability to modify its own code. The intelligent agent is patterned closely after AIXI with these speci c assumptions: 1) the utility function is an integrated part of the agent's code, and 2) the environment has read-only access to the agent's code. On the basis of some simple modi cations to the utility and horiz...

متن کامل

An ECC-Based Mutual Authentication Scheme with One Time Signature (OTS) in Advanced Metering Infrastructure

Advanced metering infrastructure (AMI) is a key part of the smart grid; thus, one of the most important concerns is to offer a secure mutual authentication. This study focuses on communication between a smart meter and a server on the utility side. Hence, a mutual authentication mechanism in AMI is presented based on the elliptic curve cryptography (ECC) and one time signature (OTS) consists o...

متن کامل

A Society of Self-organizing Agents in the Intelligent Home

A concept of the intelligent home is presented which is based on the idea that the functioning of the intelligent home emerges from the cooperation of different agents each representing a device of the home. The agents are grouped together in systems each of which realizes a certain function of the intelligent home. The security system is taken as an example for the way how these systems may be...

متن کامل

Enhancing intelligent agents with episodic memory

For a human, episodic memory is a memory of past experiences that one gains over a lifetime. While episodic memory appears critical to human function, researchers have done little to explore the potential benefits for an artificially intelligent agent. In this research, we have added a task-independent, episodic memory to a cognitive architecture. To frame the research, we propose that episodic...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

J. Exp. Theor. Artif. Intell.

دوره 26 شماره

صفحات -

تاریخ انتشار 2014

Utility function security in artificially intelligent agents

نویسنده

چکیده

منابع مشابه

Entropy-based Consensus for Distributed Data Clustering

Self-Modi cation and Mortality in Arti cial Agents

An ECC-Based Mutual Authentication Scheme with One Time Signature (OTS) in Advanced Metering Infrastructure

A Society of Self-organizing Agents in the Intelligent Home

Enhancing intelligent agents with episodic memory

عنوان ژورنال:

اشتراک گذاری